Automatic Naming of Speakers in Video via Name-Face Mapping
نویسندگان
چکیده
The problem of automatically labelling the appearances of characters in video with their names is challenging due to the huge variation in the appearance of each character and the weakness and ambiguity of available annotations. We can achieve high precision by combining multiple sources of information, both visual and textual. The principal novelties that we introduce in this paper are: (i) extracting face features in video by neural network; (ii) strengthening the mapping between names and faces by analyzing the co-occurrence of names and faces; (iii) automatically and efficiently labelling appearances of main characters with their names.
منابع مشابه
Video-based face recognition in color space by graph-based discriminant analysis
Video-based face recognition has attracted significant attention in many applications such as media technology, network security, human-machine interfaces, and automatic access control system in the past decade. The usual way for face recognition is based upon the grayscale image produced by combining the three color component images. In this work, we consider grayscale image as well as color s...
متن کاملUnsupervised naming of speakers in broadcast TV: using written names, pronounced names or both?
Persons identification in video from TV broadcast is a valuable tool for indexing them. However, the use of biometric models is not a very sustainable option without a priori knowledge of people present in the videos. The pronounced names (PN) or written names (WN) on the screen can provide hypotheses names for speakers. We propose an experimental comparison of the potential of these two modali...
متن کاملHello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video
We investigate the problem of automatically labelling appearances of characters in TV or film material. This is tremendously challenging due to the huge variation in imaged appearance of each character and the weakness and ambiguity of available annotation. However, we demonstrate that high precision can be achieved by combining multiple sources of information, both visual and textual. The prin...
متن کاملAutomatic word naming recognition for an on-line aphasia treatment system
One of the most common effects among aphasia patients is the difficulty to recall names or words. Typically, word retrieval problems can be treated through word naming therapeutic exercises. In fact, the frequency and the intensity of speech therapy are key factors in the recovery of lost communication functionalities. In this sense, speech and language technology can have a relevant contributi...
متن کاملFacial Expression Recognition Based on Anatomical Structure of Human Face
Automatic analysis of human facial expressions is one of the challenging problems in machine vision systems. It has many applications in human-computer interactions such as, social signal processing, social robots, deceit detection, interactive video and behavior monitoring. In this paper, we develop a new method for automatic facial expression recognition based on facial muscle anatomy and hum...
متن کامل